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Method for distinguishing AML-specific FLT3 length mutations from TKD 

mutations 

The present invention is directed to a method for distinguishing AML-specific 
FLT3 length mutations from TKD mutations by determining the expression level of 
selected marker genes. 

Leukemias are classified into four different groups or types: acute myeloid (AML), 
acute lymphatic (ALL), chronic myeloid (CML) and chronic lymphatic leukemia 
(CLL). Within these groups, several subcategories can be identified further using a 
panel of standard techniques as described below. These different subcategories in 
leukemias are associated with varying clinical outcome and therefore are the basis 
for different treatment strategies. The importance of highly specific classification 
may be illustrated in detail further for the AML as a very heterogeneous group of 
diseases. Effort is aimed at identifying biological entities and to distinguish and 
classify subgroups of AML which are associated with a favorable, intermediate or 
unfavorable prognosis, respectively. In 1976, the FAB classification was proposed 
by the French-American-British co-operative group which was based on 
cytomorphology and cytochemistry in order to separate AML subgroups according 
to the morphological appearance of blasts in the blood and bone marrow. In 
addition, it was recognized that genetic abnormalities occurring in the leukemic 
blast had a major impact on the morphological picture and even more on the 
prognosis. So far, the karyotype of the leukemic blasts is the most important 
independent prognostic factor regarding response to therapy as well as survival. 

Usually, a combination of methods is necessary to obtain the most important 
information in leukemia diagnostics: Analysis of the morphology and 
25 cytochemistry of bone marrow blasts and peripheral blood cells is necessary to 

establish the diagnosis. In some cases the addition of immunophenotyping is 
mandatory to separate very undifferentiated AML from acute lymphoblastic 
leukemia and CLL. Leukemia subtypes investigated can be diagnosed by 
cytomorphology alone, only if an expert reviews the smears. However, a genetic 
30 analysis based on chromosome analysis, fluorescence in situ hybridization or RT- 
PCR and immunophenotyping is required in order to assign all cases into the right 
category. The aim of these techniques besides diagnosis is mainly to determine the 
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prognosis of the leukemia. A major disadvantage of these methods, however, is that 
viable cells are necessary as the cells for genetic analysis have to divide in vitro in 
order to obtain metaphases for the analysis. Another problem is the long time of 72 
hours from receipt of the material in the laboratory to obtain the result. 
Furthermore, great experience in preparation of chromosomes and even more in 
analyzing the karyotypes is required to obtain the correct result in at least 90% of 
cases. Using these techniques in combination, hematological malignancies in a first 
approach are separated into chronic myeloid leukemia (CML), chronic lymphatic 
(CLL), acute lymphoblastic (ALL), and acute myeloid leukemia (AML). Within 
the latter three disease entities several prognostically relevant subtypes have been 
established. As a second approach this further sub-classification is based mainly on 
genetic abnormalities of the leukemic blasts and clearly is associated with different 
prognoses. 

The sub-classification of leukemias becomes increasingly important to guide 
therapy. The development of new, specific drugs and treatment approaches requires 
the identification of specific subtypes that may benefit from a distinct therapeutic 
protocol and, thus, can improve outcome of distinct subsets of leukemia. For 
example, the new therapeutic drug (STI571, Imatinib) inhibits the CML specific 
chimeric tyrosine kinase BCR-ABL generated from the genetic defect observed in 
CML, the BCR-ABL-rearrangement due to the translocation between 
chromosomes 9 and 22 (t(9;22) (q34; qll)). In patients treated with this new drug, 
the therapy response is dramatically higher as compared to all other drags that had 
been used so far. Another example is the subtype of acute myeloid leukemia AML 
M3 and its variant M3v both with karyotype t(15;17)(q22;* qll-12). The 
introduction of a new drug (all-trans retinoic acid - ATRA) has improved the 
outcome in this subgroup of patients from about 50% to 85 % long-term survivors. 
As it is mandatory for these patients suffering from these specific leukemia 
subtypes to be identified as fast as possible so that the best therapy can be applied, 
diagnostics today must accomplish sub-classification with maximal precision. Not 
only for these subtypes but also for several other leukemia subtypes different 
treatment approaches could improve outcome. Therefore, rapid and precise 
identification of distinct leukemia subtypes is the future goal for diagnostics. 
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Thus, the technical problem underlying the present invention was to provide means 
for leukemia diagnostics which overcome at least some of the disadvantages of the 
prior art diagnostic methods, in particular encompassing the time-consuming and 
unreliable combination of different methods and which provides a rapid assay to 
5 unambiguously distinguish one AML subtype from another, e.g. by genetic 
analysis. 

According to Golub et al. (Science, 1999, 286, 531-7), gene expression profiles can 
be used for class prediction and discriminating AML from ALL samples. However, 
for the analysis of acute leukemias the selection of the two different subgroups was 
performed using exclusively morphologic-phenotypical criteria. This was only 
descriptive and does not provide deeper insights into the pathogenesis or the 
underlying biology of the leukemia. The approach reproduces only very basic 
knowledge of cytomorphology and intends to differentiate classes. The data is not 
sufficient to predict prognostically relevant cytogenetic aberrations. 

Furthermore, the international application WO- A 03/039443 discloses marker 
genes the expression levels of which are characteristic for certain leukemia, e.g. 
AML subtypes and additionally discloses methods for differentiating between the 
20 subtype of AML cells by determining the expression profile of the disclosed 
marker genes. However, WO- A 03/039443 does not provide guidance which set of 
distinct genes discriminate between two subtypes and, as such, can be routineously 
taken in order to distinguish one AML subtype from another. 
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25 The problem is solved by the present invention, which provides a method for 
distinguishing AML-specific FLT3 length mutations from TKD mutations in a 
sample, the method comprising determining the expression level of markers 
selected from the markers identifiable by their Asymetrix Identification Numbers 
(affy ID) as defined in Tables 1, and/or 2, 

30 wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 1.1 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1.1 having a positive fc value, 
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is indicative for the presence of AML_D835 when AML_D835 is 
distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
5 of the numbers 1 to 50 of Table 1.2 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1 .2 having a positive fc value, 

is indicative for the presence of AML_Double when AMLJDouble is 
distinguished from all other subtypes, 

1 0 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 1.3 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1 .3 having a positive fc value, 

15 is indicative for the presence of AMLJ3tatus-l when AML_Status-l is 

distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 1.4 having a negative fc value, and/or 

20 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 1.4 having a positive fc value, 

is indicative for the presence of AMLJ3tatus-2 when AML_Status-2 is 
distinguished from all other subtypes, 

and/or wherein 

25 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 1.5 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1.5 having a positive fc value, 

is indicative for the presence of AMLJStatus-3 when AML_Status-3 is 
30 distinguished from all other subtypes, 

and/or wherein 
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a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 1.6 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1 .6 having a positive fc value, 

5 is indicative for the presence of AML_Status-4 when AML_Status-4 is 

distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 1.7 having a negative fc value, and/or 

10 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 1.7 having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML_Status-5 is 
distinguished from all other subtypes, 

and/or wherein 

1 5 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 1.8 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 1.8 having a positive fc value, 

is indicative for the presence of AML_normal when AML_normal is 
20 distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.1 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
25 one of the numbers 1 to 50 of Table 2.1 having a positive fc value, 

is indicative for the presence of AML_D835 when AMLJD835 is 
distinguished from AMLJDouble, 

and/or wherein 



30 



a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.2 having a negative fc value, and/or 



WO 2005/043168 



PCT/EP2004/012470 



-6- 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.2 having a positive fc value, 

is indicative for the presence of AML_D835 when AMLJ3835 is 
distinguished from AML_Status-l, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.3 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.3 having a positive fc value, 

10 is indicative for the presence of AMLJD835 when AMLJD835 is 

distinguished from AML_Status-2, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.4 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.4 having a positive fc value, 

is indicative for the presence of AML_D835 when AMLJD835 is 
distinguished from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.5 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.5 having a positive fc value, 

is indicative for the presence of AMLJ3835 when AML_D835 is 
distinguished from AML_Status-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.6 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.6 having a positive fc value, 
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is indicative for the presence of AMLJD835 when AML_D835 is 
distinguished from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
5 of the numbers 1 to 50 of Table 2.7 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.7 having a positive fc value, 

is indicative for the presence of AMLJD835 when AMLJD835 is 
distinguished from AML_normal, 

1 0 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.8 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.8 having a positive fc value, 

15 is indicative for the presence of AML_Double when AML_Double is 

distinguished from AMLJStatus-1, 

* 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.9 having a negative fc value, and/or 

20 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 2.9 having a positive fc value, 

is indicative for the presence of AML_Double when AML_Double is 
distinguished from AML_Status-2, 

and/or wherein 

25 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 2.10 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.10 having a positive fc value, 

is indicative for the presence of AML_Double when AML__Double is 
3 0 distinguished from AML_Status-3 , 

and/or wherein 
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a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.1 1 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2-11 having a positive fc value, 

5 is indicative for the presence of AMLJDouble when AMLJDouble is 

distinguished from AML_Status-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.12 having a negative fc value, and/or 

10 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 2.12 having a positive fc value, 

is indicative for the presence of AMLJDouble when AML_Double is 
distinguished from AML_Status-5, 

and/or wherein 

15 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 2.13 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.13 having a positive fc value, 

is indicative for the presence of AMLJDouble when AMLJDouble is 
20 distinguished from AML_normal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.14 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
25 one of the numbers 1 to 50 of Table 2.14 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML Status- 1 is 
distinguished from AMLJ3tatus-2, 

and/or wherein 



30 



a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.15 having a negative fc value, and/or 
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a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.15 having a positive fc value, 

is indicative for the presence of AMLjStatus-1 when AMLJStatus-1 is 
distinguished from AMLJStatus-3, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.16 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.16 having a positive fc value, 

10 is indicative for the presence of AMLJStatus-1 when AMLJStatus-1 is 

distinguished from AMLJStatus-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.17 having a negative fc value, and/or 

15 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 2.17 having a positive fc value, 

is indicative for the presence of AMLJStatus-1 when AML_Status-l is 
distinguished from AMLJStatus-5, 

and/or wherein 

20 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 2.18 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.18 having a positive fc value, 

is indicative for the presence of AMLJStatus-1 when AML_Status-l is 
25 distinguished from AML_normal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.19 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
30 one of the numbers 1 to 50 of Table 2.19 having a positive fc value, 
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is indicative for the presence of AML_Status-2 when AMLJStatus-2 is 
distinguished from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
5 of the numbers 1 to 50 of Table 2.20 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.20 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AMLJStatus-2 is 
distinguished from AMLJStatus-4, 

1 0 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.21 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.21 having a positive fc value, 

15 is indicative for the presence of AMLJStatus-2 when AMLJStatus-2 is 

distinguished from AML_Status-5 , 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.22 having a negative fc value, and/or 

20 a higher expression of at least one polynucleotide defined by at least 

one of the numbers 1 to 50 of Table 2.22 having a positive fc value, 

is indicative for the presence of AMLJStatus-2 when AMLJStatus-2 is 
distinguished from AMLjaormal, 

and/or wherein 

25 a lower expression of at least one polynucleotide defined by at least one 

of the numbers 1 to 50 of Table 2.23 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.23 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AML_Status-3 is 
3 0 distinguished from AML JStatus-4, 

* 

and/or wherein 
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a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.24 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.24 having a positive fc value, 

is indicative for the presence of AMLJStatus-3 when AML_Status-3 is 
distinguished from AMLJStatus-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.25 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.25 having a positive fc value, 

is indicative for the presence of AMLJStatus-3 when AML_Status-3 is 
distinguished from AMLnormal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.26 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.26 having a positive fc value, 

is indicative for the presence of AMLJStatus-4 when AMLjStatus-4 is 
distinguished from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.27 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.27 having a positive fc value, 

is indicative for the presence of AML_Status-4 when AMLJStatus-4 is 
distinguished from AML normal, 

and/or wherein 



a lower expression of at least one polynucleotide defined by at least one 
of the numbers 1 to 50 of Table 2.28 having a negative fc value, and/or 



WO 2005/043168 



PCT/EP2004/012470 



- 12- 

a higher expression of at least one polynucleotide defined by at least 
one of the numbers 1 to 50 of Table 2.28 having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML JStatus-5 is 
distinguished from AMLjaormal. 

5 

FLT3 stands for FMS-like tyrosine kinase 3. TKD stand for tyrosin kinase domain 
of FLT3. 

Two mayor types of mutations in the FLT3-Gene have been described. 

10 

1) Length mutations in the juxtamembrane domain (FLT3-LM, status 1, 2, 3, 4, 5) 

2) point mutations in codons D835 or 1836 or deletions of 1836 in the tyrosine 
kinase domain (all coded as D835, refers to TKD mutation). 

3) "Double" means that both types of mutations were found in a single patient. 

15 

As used herein, the abbreviations used above apply for the following AML 
subtypes (see Also Example 1): 

1) AML normal (normal karyotype) and no FLT3 mutation, 

2) AML_status 1 : FLT3-LM/WT (wildtype) ratio <0.3, 
20 3) AML_status 2: ratio 0.7-1.1 , 

4) AML_status 3: ratio of >1.2=partial loss of WT (wild type), 

5) AML_status 4: total loss of WT , 

6) AML_status 5: two or more low status mutations 

7) AMLD835: D835/TKD mutation (mutation in the tyrosine kinase domain 
25 of FLT3) 

8) AML_Double: mutations D835/TKD and FLT3-LM 

As used herein, "all other subtypes'* refer to the subtypes of the present invention, 
i.e. if one subtype is distinguished from "all other subtypes", it is distiguished from 
30 all other subtypes contained in the present invention. 

According to the present invention, a "sample" means any biological material 
containing genetic information in the form of nucleic acids or proteins obtainable 
or obtained from an individual. The sample includes e.g. tissue samples, cell 
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samples, bone marrow and/or body fluids such as blood, saliva, semen. Preferably, 
the sample is blood or bone marrow, more preferably the sample is bone marrow. 
The person skilled in the art is aware of methods, how to isolate nucleic acids and 
proteins from a sample. A general method for isolating and preparing nucleic acids 
from a sample is outlined in Example 3. 

According to the present invention, the term "lower expression" is generally 
assigned to all by numbers and Affymetrix ID. definable polynucleotides the t- 
values and fold change (fc) values of which are negative, as indicated in the Tables. 
Accordingly, the term "higher expression" is generally assigned to all by numbers 
and Affymetrix ID. definable polynucleotides the t-values and fold change (fc) 
values of which are positive. 

According to the present invention, the term "expression" refers to the process by 
which mRNA or a polypeptide is produced based on the nucleic acid sequence of a 
gene, i.e. expression" also includes the formation of mRNA upon transcription. In 
accordance with the present invention, the term determining the expression level" 
preferably refers to the determination of the level of expression, namely of the 
markers. 

Generally, "marker" refers to any genetically controlled difference which can be 
used in the genetic analysis of a test versus a control sample, for the purpose of 
assigning the sample to a defined genotype or phenotype. As used herein, 
"markers" refer to genes which are differentially expressed in, e.g., different AML 
subtypes. The markers can be defined by their gene symbol name, their encoded 
protein name, their transcript identification number (cluster identification number), 
the data base accession number, public accession number or GenBank identifier or, 
as done in the present invention, Affymetrix identification number, chromosomal 
location, UniGene accession number and cluster type, LocusLink accession number 
(see Examples and Tables). 

The Affymetrix identification number (affy ID) is accessible for anyone and the 
person skilled in the art by entering the "gene expression omnibus" internet page of 
the National Center for Biotechnology Information (NCBI) 
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(http://wvm.ncbi.11lm.nih.gov/geo/). In particular, the affy CD's of the 
polynucleotides used for the method of the present invention are derived from the 
so-called U133 chip. The sequence data of each identification number can be 
viewed at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL96 

5 

Generally, the expression level of a marker is determined by the determining the 
expression of its corresponding '^polynucleotide" as described hereinafter. 

According to the present invention, the term ^polynucleotide" refers, generally, to a 
10 DNA, in particular cDNA, or RNA, in particular a cRNA, or a portion thereof or a 
polypeptide or a portion thereof. In the case of RNA (or cDNA), the polynucleotide 
is formed upon transcription of a nucleotide sequence which is capable of 
expression. The polynucleotide fragments refer to fragments preferably of between 
at least 8, such as 10, 12, 15 or 18 nucleotides and at least 50, such as 60, 80, 100, 
15 200 or 300 nucleotides in length, or a complementary sequence thereto, 

representing a consecutive stretch of nucleotides of a gene, cDNA or mRNA. In 
other terms, polynucleotides include also any fragment (or complementary 
sequence thereto) of a sequence derived from any of the markers defined above as 
long as these fragments unambiguously identify the marker. 

20 

The determination of the expression level may be effected at the transcriptional or 
translational level, i.e. at the level of mRNA or at the protein level. Protein 
fragments such as peptides or polypeptides advantageously comprise between at 
least 6 and at least 25, such as 30, 40, 80, 100 or 200 consecutive amino acids 
25 representative of the corresponding full length protein. Six amino acids are 
generally recognized as the lowest peptidic stretch giving rise to a linear epitope 
recognized by an antibody, fragment or derivative thereof. Alternatively, the 
proteins or fragments thereof may be analysed using nucleic acid molecules 
specifically binding to three-dimensional structures (aptamers). 

30 

Depending on the nature of the polynucleotide or polypeptide, the determination of 
the expression levels may be effected by a variety of methods. For determining and 
detecting the expression level, it is preferred in the present invention that the 
polynucleotide, in particular the cRNA, is labelled. 
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The labelling of the polynucleotide or a polypeptide can occur by a variety of 
methods known to the skilled artisan. The label can be fluorescent, 
chemiluminescent, bioluminescent, radioactive (such as 3 H or 32 P). The labelling 
5 compound can be any labelling compound being suitable for the labelling of 
polynucleotides and/or polypeptides. Examples include fluorescent dyes, such as 
fluorescein, dichlorofluorescein, hexachlorofluorescein, BODIPY variants, ROX, 
tetramethylrhodamin, rhodamin X, Cyanine-2, Cyanine-3, Cyanine-5, Cyanine-7, 
IRD40, FluorX, Oregon Green, Alexa variants (available e.g. from Molecular 

10 Probes or Amersham Biosciences) and the like, biotin or biotinylated nucleotides, 
digoxigenin, radioisotopes, antibodies, enzymes and receptors. Depending on the 
type of labelling, the detection is done via fluorescence measurements, conjugation 
to streptavidin and/or avidin, antigen-antibody- and/or antibody-antibody- 
interactions, radioactivity measurements, as well as catalytic and/or receptor/ligand 

15 interactions. Suitable methods include the direct labelling (incorporation) method, 
the amino-modified (amino-allyl) nucleotide method (available e.g. from Ambion), 
and the primer tagging method (DNA dendrimer labelling, as kit available e.g. 
from Genisphere). Particularly preferred for the present invention is the use of 
biotin or biotinylated nucleotides for labelling, with the latter being directly 

20 incorporated into, e.g. the cRNA polynucleotide by in vitro transcription. 



If the polynucleotide is mRNA, cDNA may be prepared into which a detectable 
label, as exemplified above, is incorporated. Said detectably labelled cDNA, in 
single-stranded form, may then be hybridised, preferably under stringent or highly 

25 stringent conditions to a panel of single-stranded oligonucleotides representing 

different genes and affixed to a solid support such as a chip. Upon applying 
appropriate washing steps, those cDNAs will be detected or quantitatively detected 
that have a counterpart in the oligonucleotide panel. Various advantageous 
embodiments of this general method are feasible. For example, the mRNA or the 

30 cDNA may be amplified e.g. by polymerase chain reaction, wherein it is 
preferable, for quantitative assessments, that the number of amplified copies 
corresponds relative to further amplified mRNAs or cDNAs to the number of 
mRNAs originally present in the cell. In a preferred embodiment of the present 
invention, the cDNAs are transcribed into cRNAs prior to the hybridisation step 

35 wherein only in the transcription step a label is incorporated into the nucleic acid 
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and wherein the cRNA is employed for hybridisation. Alternatively, the label may 
be attached subsequent to the transcription step. 



Similarly, proteins from a cell or tissue under investigation may be contacted with 
5 a panel of aptamers or of antibodies or fragments or derivatives thereof The 
antibodies etc. may be affixed to a solid support such as a chip. Binding of proteins 
indicative of an AML subtype may be verified by binding to a detectably labelled 
secondary antibody or aptamer. For the labelling of antibodies, it is referred to 
Harlow and Lane, "Antibodies, a laboratory manual", CSH Press, 1988, Cold 

1 0 Spring Harbor. Specifically, a minimum set of proteins necessary for diagnosis of 
all AML subtypes may be selected for creation of a protein array system to make 
diagnosis on a protein lysate of a diagnostic bone marrow sample directly. Protein 
Array Systems for the detection of specific protein expression profiles already are 
available (for example: Bio-Plex, BIORAD, Munchen, Germany). For this 

15 application preferably antibodies against the proteins have to be produced and 
immobilized on a platform e.g. glasslides or microtiterplates. The immobilized 
antibodies can be labelled with a reactant specific for the certain target proteins as 
discussed above. The reactants can include enzyme substrates, DNA, receptors, 
antigens or antibodies to create for example a capture sandwich immunoassay. 

20 

For reliably distinguishing AML-specific FLT3 length mutations from TKD 
mutations it is useful that the expression of more than one of the above defined 
markers is determined. As a criterion for the choice of markers, the statistical 
significance of markers as expressed in q or p values based on the concept of the 
25 false discovery rate is determined. In doing so, a measure of statistical significance 
called the q value is associated with each tested feature. The q value is similar to the 
p value, except it is a measure of significance in terms of the false discovery rate 
rather than the false positive rate (Storey JD and Tibshirani R. Proc.Natl.Acad.Sci., 
2003, Vol. 100:9440-5. 

30 

In a preferred embodiment of the present invention, markers as defined in Tables 1- 
2 having a q-value of less than, 3E-02, less than 3E-06, more preferred less than 
1.5E-09, most preferred less than 1.5E-11, less than 1.5E-20, less than 1.5E-30, are 
measured. 



35 
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Of the above defined markers, the expression level of at least two, preferably of at 
least ten, more preferably of at least 25, most preferably of 50 of at least one of the 
Tables of the markers is determined. 

5 In another preferred embodiment, the expression level of at least 2, of at least 5, of 
at least 10 out of the markers having the numbers 1 - 10, 1-20, 1-40, 1-50 of at 
least one of the Tables are measured. 

The level of the expression of the „marker", i.e. the expression of the 
10 polynucleotide is indicative of the AML subtype of a cell or an organism. The level 
of expression of a marker or group of markers is measured and is compared with 
the level of expression of the same marker or the same group of markers from other 
cells or samples. The comparison may be effected in an actual experiment or in 
silico. When the expression level also referred to as expression pattern or 
15 expression signature (expression profile) is measurably different, there is according 
to the invention a meaningful difference in the level of expression. Preferably the 
difference at least is 5 %, 10% or 20%, more preferred at least 50% or may even be 
as high as 75% or 100%. More preferred the difference in the level of expression is 
at least 200%, i.e. two fold, at least 500%, i.e. five fold, or at least 1000%, i.e. 10 
20 fold. 

Accordingly, the expression level of markers expressed lower in a first subtype 
than in at least one second subtype, which differs from the first subtype, is at least 
5 %, 10% or 20%, more preferred at least 50% or may even be 75% or 100%, i.e. 

25 2-fold lower, preferably at least 10-fold, more preferably at least 50-fold, and most 
preferably at least 100-fold lower in the first subtype. On the other hand, the 
expression level of markers expressed higher in a first subtype than in at least one 
second subtype, which differs from the first subtype, is at least 5 %, 10% or 20%, 
more preferred at least 50% or may even be 75% or 100%, i.e. 2-fold higher, 

30 preferably at least 10-fold, more preferably at least 50-fold, and most preferably at 
least 100-fold higher in the first subtype. 

In another embodiment of the present invention, the sample is derived from an 
individual having leukemia, preferably AML. 

35 
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For the method of the present invention it is preferred if the polynucleotide the 
expression level of which is determined is in form of a transcribed polynucleotide. 
A particularly preferred transcribed polynucleotide is an mRNA, a cDNA and/or a 
cRNA, with the latter being preferred. Transcribed polynucleotides are isolated 
5 from a sample, reverse transcribed and/or amplified, and labelled, by employing 
methods well-known the person skilled in the art (see Example 3). In a preferred 
embodiment of the methods according to the invention, the step of deter m ining the 
expression profile further comprises amplifying the transcribed polynucleotide. 

10 In order to determine the expression level of the transcribed polynucleotide by the 
method of the present invention, it is preferred that the method comprises 
hybridizing the transcribed polynucleotide to a complementary polynucleotide, or a 
portion thereof, under stringent hybridization conditions, as described hereinafter. 

15 The term "hybridizing" means hybridization under conventional hybridization 
conditions, preferably under stringent conditions as described, for example, in 
Sambrook, J., et al., in "Molecular Cloning: A Laboratory Manual" (1989), Eds. J. 
Sambrook, E. F. Fritsch and T. Maniatis, Cold Spring Harbour Laboratory Press, 
Cold Spring Harbour, NY and the further definitions provided above. Such 

20 conditions are, for example, hybridization in 6x SSC, pH 7.0 / 0.1% SDS at about 
45°C for 18-23 hours, followed by a washing step with 2x SSC/0.1% SDS at 50°C. 
In order to select the stringency, the salt concentration in the washing step can for 
example be chosen between 2x SSC/0.1% SDS at room temperature for low 
stringency and 0.2x SSC/0.1% SDS at 50°C for high stringency. In addition, the 

25 temperature of the washing step can be varied between room temperature, ca. 22°C, 
for low stringency, and 65 °C to 70° C for high stringency. Also contemplated are 
polynucleotides that hybridize at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation, preferably of formamide concentration 

30 (lower percentages of formamide result in lowered stringency), salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 
0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 mg/ml 
salmon sperm blocking DNA, followed by washes at 50°C with 1 X SSPE, 0.1% 

35 SDS. In addition, to achieve even lower stringency, washes performed following 
stringent hybridization can be done at higher salt concentrations (e.g. 5x SSC). 
Variations in the above conditions may be accomplished through the inclusion 
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and/or substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

5 

"Complementary" and "complementarity", respectively, can be described by the 
percentage, i.e. proportion, of nucleotides which can form base pairs between two 
polynucleotide strands or within a specific region or domain of the two strands. 
Generally, complementary nucleotides are, according to the base pairing rules, 

10 adenine and thymine (or adenine and uracil), and cytosine and guanine. 

Complementarity may be partial, in which only some of the nucleic acids 1 bases are 
matched according to the base pairing rules. Or, there may be a complete or total 
complementarity between the nucleic acids. The degree of complementarity 
between nucleic acid strands has effects on the efficiency and strength of 

1 5 hybridization between nucleic acid strands. 

Two nucleic acid strands are considered to be 100% complementary to each other 
over a defined length if in a defined region all adenines of a first strand can pair 
with a thymine (or an uracil) of a second strand, all guanines of a first strand can 

20 pair with a cytosine of a second strand, all thymine (or uracils) of a first strand can 
pair with an adenine of a second strand, and all cytosines of a first strand can pair 
with a guanine of a second strand, and vice versa. According to the present 
invention, the degree of complementarity is determined over a stretch of 20, 
preferably 25, nucleotides, i.e. a 60% complementarity means that within a region 

25 of 20 nucleotides of two nucleic acid strands 12 nucleotides of the first strand can 
base pair with 12 nucleotides of the second strand according to the above ruling, 
either as a stretch of 12 contiguous nucleotides or interspersed by non-pairing 
nucleotides, when the two strands are attached to each other over said region of 20 
nucleotides. The degree of complementarity can range from at least about 50% to 

30 full, i.e. 100% complementarity. Two single nucleic acid strands are said to be 
"substantially complementary" when they are at least about 80% complementary, 
preferably about 90% or higher. For carrying out the method of the present 
invention substantial complementarity is preferred. 

35 Preferred methods for detection and quantification of the amount of 
polynucleotides, i.e. for the methods according to the invention allowing the 
determination of the level of expression of a marker, are those described by 
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Sambrook et al. (1989) or real time methods known in the art as the TaqMan® 
method disclosed in WO92/02638 and the corresponding U.S. 5,210,015, U.S. 
5,804,375, U.S. 5,487,972. This method exploits the exonuclease activity of a 
polymerase to generate a signal. In detail, the (at least one) target nucleic acid 
5 component is detected by a process comprising contacting the sample with an 
oligonucleotide containing a sequence complementary to a region of the target 
nucleic acid component and a labeled oligonucleotide containing a sequence 
complementary to a second region of the same target nucleic acid component 
sequence strand, but not including the nucleic acid sequence defined by the first 

10 oligonucleotide, to create a mixture of duplexes during hybridization conditions, 
wherein the duplexes comprise the target nucleic acid annealed to the first 
oligonucleotide and to the labeled oligonucleotide such that the 3 '-end of the first 
oligonucleotide is adjacent to the 5 '-end of the labeled oligonucleotide. Then this 
mixture is treated with a template-dependent nucleic acid polymerase having a 5' 

15 to 3' nuclease activity under conditions sufficient to permit the 5' to 3' nuclease 
activity of the polymerase to cleave the annealed, labeled oligonucleotide and 
release labeled fragments. The signal generated by the hydrolysis of the labeled 
oligonucleotide is detected and/ or measured. TaqMan® technology eliminates the 
need for a solid phase bound reaction complex to be formed and made detectable. 

20 Other methods include e.g. fluorescence resonance energy transfer between two 
adjacently hybridized probes as used in the LightCycler® format described in U.S. 
6,174,670. 

A preferred protocol if the marker, i.e. the polynucleotide, is in form of a 
25 transcribed nucleotide, is described in Example 3, where total RNA is isolated, 
cDNA and, subsequently, cRNA is synthesized and biotin is incorporated during 
the transcription reaction. The purified cRNA is applied to commercially available 
arrays which can be obtained e.g. from Affymetrix. The hybridized cRNA is 
detected according to the methods described in Example 3. The arrays are produced 
30 by photolithography or other methods known to experts skilled in the art e.g. from 
U.S. 5,445,934, U.S. 5,744,305, U.S. 5,700,637, U.S. 5,945,334 and EP 0 619 321 
or EP 0 373 203, or as decribed hereinafter in greater detail. 

In another embodiment of the present invention, the polynucleotide or at least one 
35 of the polynucleotides is in form of a polypeptide. In another preferred 
embodiment, the expression level of the polynucleotides or polypeptides is detected 
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using a compound which specifically binds to the polynucleotide of the polypeptide 
of the present invention. 

As used herein, "specifically binding" means that the compound is capable of 
5 discriminating between two or more polynucleotides or polypeptides, i.e. it binds to 
the desired polynucleotide or polypeptide, but essentially does not bind 
unspecifically to a different polynucleotide or polypeptide. 

The compound can be an antibody, or a fragment thereof, an enzyme, a so-called 
10 small molecule compound, a protein-scaffold, preferably an anticalin. In a 
preferred embodiment, the compound specifically binding to the polynucleotide or 
polypeptide is an antibody, or a fragment thereof. 

As used herein, an "antibody" comprises monoclonal antibodies as first described 
15 by Kohler and Milstein in Nature 278 (1975), 495-497 as well as polyclonal 
antibodies, i.e. antibodies contained in a polyclonal antiserum. Monoclonal 
antibodies include those produced by transgenic mice. Fragments of antibodies 
include F(ab') 2 , Fab and Fv fragments. Derivatives of antibodies include scFvs, 
chimeric and humanized antibodies. See, for example Harlow and Lane, loc. cit. 
20 For the detection of polypeptides using antibodies or fragments thereof, the person 
skilled in the art is aware of a variety of methods, all of which are included in the 
present invention. Examples include immunoprecipitation, Western blotting, 
Enzyme-linked immuno sorbent assay (ELISA), Enzyme-linked immuno sorbent 
assay (RIA), dissociation-enhanced lanthanide fluoro immuno assay (DELFIA), 
25 scintillation proximity assay (SPA). For detection, it is desirable if the antibody is 
labelled by one of the labelling compounds and methods described supra. 

In another preferred embodiment of the present invention, the method for 
distinguishing AML-specific FLT3 length mutations from TKD mutations is 
30 carried out on an array. 

In general, an "array 35 or "microarray" refers to a linear or two- or three 
dimensional arrangement of preferably discrete nucleic acid or polypeptide probes 
which comprises an intentionally created collection of nucleic acid or polypeptide 
35 probes of any length spotted onto a substrate/solid support The person skilled in 
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the art knows a collection of nucleic acids or polypeptide spotted onto a 
substrate/solid support also under the term "array". As known to the person skilled 
in the art, a microarray usually refers to a miniaturised array arrangement, with the 
probes being attached to a density of at least about 10, 20, 50, 100 nucleic acid 
5 molecules referring to different or the same genes per cm 2 . Furthermore, where 
appropriate an array can be referred to as "gene chip". The array itself can have 
different formats, e.g. libraries of soluble probes or libraries of probes tethered to 
resin beads, silica chips, or other solid supports. 

1 0 The process of array fabrication is well-known to the person skilled in the art. In 
the following, the process for preparing a nucleic acid array is described. 
Commonly, the process comprises preparing a glass (or other) slide (e.g. chemical 
treatment of the glass to enhance binding of the nucleic acid probes to the glass 
surface), obtaining DNA sequences representing genes of a genome of interest, and 

15 spotting sequences these sequences of interest onto glass slide. Sequences of 
interest can be obtained via creating a cDNA library from an mRNA source or by 
using publicly available databases, such as GeneBank, to annotate the sequence 
information of custom cDNA libraries or to identify cDNA clones from previously 
prepared libraries. Generally, it is recommendable to amplify obtained sequences 

20 by PCR in order to have sufficient amounts of DNA to print on the array. The 
liquid containing the amplified probes can be deposited on the array by using a set 
of microspotting pins. Ideally, the amount deposited should be uniform. The 
process can further include UV-crosslinking in order to enhance immobilization of 
the probes on the array. 

25 

In a preferred embodiment, the array is a high density oligonucleotide (oligo) array 
using a light-directed chemical synthesis process, employing the so-called 
photolithography technology. Unlike common cDNA arrays, oligo arrays 
(according to the Affymetrix technology) use a single-dye technology. Given the 

30 sequence information of the markers, the sequence can be synthesized directly onto 
the array, thus, bypassing the need for physical intermediates, such as PCR 
products, required for making cDNA arrays. For this purpose, the marker, or partial 
sequences thereof, can be represented by 14 to 20 features, preferably by less than 
14 features, more preferably less than 10 features, even more preferably by 6 

35 features or less, with each feature being a short sequence of nucleotides 
(oligonucleotide), which is a perfect match (PM) to a segment of the respective 
gene. The PM oligonucleotide are paired with mismatch (MM) oligonucleotides 
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which have a single mismatch at the central base of the nucleotide and are used as 
"controls". The chip exposure sites are defined by masks and are deprotected by 
the use of light, followed by a chemical coupling step resulting in the synthesis of 
one nucleotide. The masking, light deprotection, and coupling process can then be 
5 repeated to synthesize the next nucleotide, until the nucleotide chain is of the 
specified length. 

Advantageously, the method of the present invention is carried out in a robotics 
system including robotic plating and a robotic liquid transfer system, e.g. using 
10 microfluidics, i.e. channelled structured. 

A particular preferred method according to the present invention is as follows: 
1. Obtaining a sample, e.g. bone marrow or peripheral blood aliquots, from a 
patient having AML 
15 2. Extracting RNA, preferably mRNA, from the sample 

3. Reverse transcribing the RNA into cDNA 

4. In vitro transcribing the cDNA into cRNA 

5. Fragmenting the cRNA 

6. Hybridizing the fragmented cRNA on standard microarrays 
20 7. Determining hybridization 

In another embodiment, the present invention is directed to the use of at least one 
marker selected from the markers identifiable by their Affymetrix Identification 
Numbers (affy ID) as defined in Tables 1, and/or 2 for the manufacturing of a 

25 diagnostic for distinguishing AML-specific FLT3 length mutations from TKD 
mutations. The use of the present invention is particularly advantageous for 
distinguishing AML-specific FLT3 length mutations from TKD mutations in an 
individual having AML. The use of said markers for diagnosis of AML-specific 
FLT3 length mutations from TKD mutations, preferably based on microarray 

30 technology, offers the following advantages: (1) more rapid and more precise 
diagnosis, (2) easy to use in laboratories without specialized experience, (3) 
abolishes the requirement for analyzing viable cells for chromosome analysis 
(transport problem), and (4) very experienced hematologists for cytomorphology 
and cytochemistry, immunophenotyping as well as cytogeneticists and 

35 molecularbiologists are no longer required. 
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Accordingly, the present invention refers to a diagnostic kit containing at least one 
marker selected from the markers identifiable by their Affymetrix Identification 
Numbers (affy ID) as defined in Tables 1, and/or 2 for distinguishing AML- 
specific FLT3 length mutations from TKD mutations, in combination with suitable 
5 auxiliaries. Suitable auxiliaries, as used herein, include buffers, enzymes, labelling 
compounds, and the like. In a preferred embodiment, the marker contained in the 
kit is a nucleic acid molecule which is capable of hybridizing to the mRNA 
corresponding to at least one marker of the present invention. Preferably, the at 
least one nucleic acid molecule is attached to a solid support, e.g. a polystyrene 
10 microliter dish, nitrocellulose membrane, glass surface or to non-immobilized 
particles in solution. 

In another preferred embodiment, the diagnostic kit contains at least one reference 
for an AML-specific FLT3 length mutation and/or TKD mutation. As used herein, 
1 5 the reference can be a sample or a data bank. 

In another embodiment, the present invention is directed to an apparatus for 
distinguishing AML-specific FLT3 length mutations from TKD mutations in a 
sample, containing a reference data bank obtainable by comprising 
20 (a) compiling a gene expression profile of a patient sample by determining 

the expression level at least one marker selected from the markers 
identifiable by their Affymetrix Identification Numbers (affy ID) as 
defined in Tables 1 , and/or 2, and 
(b) classifying the gene expression profile by means of a machine learning 
25 algorithm. 

According to the present invention, the "machine learning algorithm" is a 
computational-based prediction methodology, also known to the person skilled in 
the art as "classifier", employed for characterizing a gene expression profile. The 

30 signals corresponding to a certain expression level which are obtained by the 
microarray hybridization are subjected to the algorithm in order to classify the 
expression profile. Supervised learning involves "training" a classifier to recognize 
the distinctions among classes and then "testing" the accuracy of the classifier on 
an independent test set For new, unknown samples the classifier shall predict into 

35 which class the sample belongs. 
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Preferably, the machine learning algorithm is selected from the group consisting of 
Weighted Voting, K-Nearest Neighbors, Decision Tree Induction, Support Vector 
Machines (SVM), and Feed-Forward Neural Networks. Most preferably, the 
machine learning algorithm is Support Vector Machine, such as polynomial kernel 
5 and Gaussian Radial Basis Function-kernel SVM models. 

The classification accuracy of a given gene list for a set of microarray experiments 
is preferably estimated using Support Vector Machines (SVM), because there is 
evidence that SVM-based prediction slightly outperforms other classification 

10 techniques like k-Nearest Neighbors (k-NN). The LIB SVM software package 
version 236 was used (SVM-type: C-SVC, linear kernel 
(http://wwwxsie.ntu.edu.tw^^ The skilled artisan is furthermore 

referred to Brown et al., Proc.NatLAcad.Sci., 2000; 97: 262-267, Furey et al., 
Bioinformatics. 2000; 16: 906-914, and Vapnik V. Statistical Learning Theory. 

1 5 New York: Wiley, 1998. 

In detail, the classification accuracy of a given gene list for a set of microarray 
experiments can be estimated using Support Vector Machines (SVM) as supervised 
learning technique. Generally, SVMs are trained using differentially expressed 

20 genes which were identified on a subset of the data and then this trained model is 
employed to assign new samples to those trained groups from a second and 
different data set. Differentially expressed genes were identified applying ANOVA 
and t-test-statistics (Welch t-test). Based on identified distinct gene expression 
signatures respective training sets consisting of 2/3 of cases and test sets with 1/3 

25 of cases to assess classification accuracies are designated. Assignment of cases to 
training and test set is randomized and balanced by diagnosis. Based on the training 
set a Support Vector Machine (SVM) model is built. 

According to the present invention, the apparent accuracy, i.e. the overall rate of 
30 correct predictions of the complete data set was estimated by lOfold cross 
validation. This means that the data set was divided into 10 approximately equally 
sized subsets, an SVM-model was trained for 9 subsets and predictions were 
generated for the remaining subset. This training and prediction process was 
repeated 10 times to include predictions for each subset. Subsequently the data set 
35 was split into a training set, consisting of two thirds of the samples, and a test set 
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with the remaining one third Apparent accuracy for the training set was estimated 
by lOfold cross validation (analogous to apparent accuracy for complete set). A 
SVM-model of the training set was built to predict diagnosis in the independent test 
set, thereby estimating true accuracy of the prediction model. This prediction 
5 approach was applied both for overall classification (multi-class) and binary 
classification (diagnosis X => yes or no). For the latter, sensitivity and specificity 
were calculated: 

Sensitivity = (number of positive samples predicted)/(number of true positives) 
Specificity = (number of negative samples predicted)/(number of true negatives) 

10 

In a preferred embodiment, the reference data bank is backed up on a 
computational data memory chip which can be inserted in as well as removed from 
the apparatus of the present invention, e.g. like an interchangeable module, in order 
to use another data memory chip containing a different reference data bank. 

15 

The apparatus of the present invention containing a desired reference data bank can 
be used in a way such that an unknown sample is, first, subjected to gene 
expression profiling, e.g. by microarray analysis in a manner as described supra or 
in the art, and the expression level data obtained by the analysis are, second, fed 

20 into the apparatus and compared with the data of the reference data bank obtainable 
by the above method. For this purpose, the apparatus suitably contains a device for 
entering the expression level of the data, for example a control panel such as a 
keyboard. The results, whether and how the data of the unknown sample fit into the 
reference data bank can be made visible on a provided monitor or display screen 

25 and, if desired, printed out on an incorporated of connected printer. 

Alternatively, the apparatus of the present invention is equipped with particular 
appliances suitable for detecting and measuring the expression profile data and, 
subsequently, proceeding with the comparison with the reference data bank. In this 
30 embodiment, the apparatus of the present invention can contain a gripper arm 
and/or a tray which takes up the microarray containing the hybridized nucleic 
acids. 

In another embodiment, the present invention refers to a reference data bank for 
35 distinguishing AML-specific FLT3 length mutations from TKD mutations in a 
sample obtainable by comprising 
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(a) compiling a gene expression profile of a patient sample by determining 
the expression level of at least one marker selected from the markers 
identifiable by their Asymetrix Identification Numbers (afiy ID) as 
defined in Tables 1, and/or 2, and 
5 (b) classifying the gene expression profile by means of a machine learning 

algorithm. 

Preferably, the reference data bank is backed up and/or contained in a 
computational memory data chip. 

10 

The invention is further illustrated in the following table and examples, without 
limiting the scope of the invention: 

TABLES 1.1-2.28 

15 Tables 1-2 show AML subtype analysis of AML-specific FLT3 length mutations 
from TKD mutations. The analysed markers are ordered according to their q- 
values, beginning with the lowest q-values. 

For convenience and a better understanding, Tables 1.1 to 2.28 are accompanied 
with explanatory tables (Table 1.1 A to 2.28 A) where the numbering and the 
20 Affymetrix Id are further defined by other parameters, e.g. gene bank accession 
number. 

EXAMPLES 

25 Example 1: General experimental design of the invention and results 

Different subtypes of acute myeloid leukemia (AML) can clearly be distinguished 
by morphology, cytogenetics, and molecular genetics. Mutations within the FLT3- 
gene are the most common genetic alterations in AML. Length mutations in the 

30 juxtamembrane domain of FLT3 (FLT3-LM) were detected in 460/2134 (21.6%) 
and mutations in the tyrosine kinase domain (TKD-mutations) in 98/1711 (5.7%) 
of newly diagnosed AML in our series. Both kinds of FLT3 mutations can be found 
most frequently in AML with normal karyotype (AML-NK). While for TKD 
mutations a prognostic significance has not yet been shown, the FLT3-LM defines 

35 a prognostically unfavorable subset of AML-NK. Some of these pts have loss of 
the wildtype (WT) FLT3 allele in addition to the FLT3-LM. These pts were shown 
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to have an even worse outcome than FLT3-LM+ pts that retain the WT-allele. Here 
we addressed the questions 1) whether pts with FLT3 mutations can be identified 
from those without FLT3 mutations within the AML-NK and 2) whether different 
types of FLT3 mutations can be identified by distinct gene expression signatures. 
5 Therefore, 148 cases with AML-NK were analyzed by U133 set microarrays 
(Affymetrix). For each patient the FLT3-LM status was assessed by GeneScan 
analysis (Applied Biosystems) and the TKD mutation status by melting curve 
analysis and sequencing. The total cohort was subdivided into 8 groups 1) AML 
NK and no FLT3 mutation (n=63), 2) status 1: FLT3-LM/WT ratio <0.3 (n=12), 3) 

10 status 2: ratio 0.7-1.1 (n=19), 4) status 3: ratio of >1.2=partial loss of WT (n=30), 
5) status 4: total loss of WT (n=5), 6) status 5: two or more low status mutations 
(n=5), 7) TKD mutation (n=10), 8) TKD + LM (n=3).Microarray data was 
analyzed by pattern recognition algorithms (Principal Component Analysis (PCA) 
and hierarchical clustering), as well as Support Vector Machines (SVM) for 

15 estimation of classification accuracies. Therefore, all samples were divided into a 
training set consisting of 2/3 of cases to built a SVM model and a test set with 
remaining 1/3 of cases. Differentially expressed genes were selected according to 
ANOVA and t-test-statistics in the training set. A specific expression pattern was 
assessed for each of the defined subgroups. Using pairwise comparisons, the TKD 

20 mutations can clearly be distingushed from the FLT3-LM. In addition, FLT3-LM 
with loss of WT reveal a specific expression pattern in comparison to low status 
FLT3-LM. By use of SVM comparisons to AML-NK and all other mutation classes 
an accuracy of 100% was found for status 4, 78% for status 3 (sensitivity 64%, 
specificity 84%), 74% for status 2 (sensitivity 17%, specificity 88%), 75% for 

25 status 1 (sensitivity 50%, specificity 79%), 88% for TKD mutations (sensitivity 
50%, specificity 95%), but only 8% for status 5 mutations. In conclusion a high 
percentage of cases of the different FLT3 -mutations can be exactly assigned. Only 
the status 5 mutations can not be defined by a specific expression profile. Besides 
their clinical differences we could show that pts with FLT3-LM with different 

30 quantitative status of the LM as well as the TKD mutations are also different with 
regard to their expression pattern. This supports the hypothesis i) that FLT3-LM 
and TKD mutations act through different downstream target genes and play 
different functional roles in leukemogenesis; ii) that FLT3-LM cases with loss of 
the WT-allele should be regarded differently, i.e. with regard to prognosis and 

35 therapeutical interventions. 
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Example 2: General materials, methods and definitions of functional 
annotations 

5 The methods section contains both information on statistical analyses used for 
identification of differentially expressed genes and detailed annotation data of 
identified microarray probesets. 

Affvmetrix Probeset Annotation 

10 All annotation data of GeneChip® arrays are extracted from the NetAfBc™ 
Analysis Center (internet website: www.affymetrix.com). Files for U133 set arrays, 
including U133A and U133B microarrays are derived from the June 2003 release. 
The original publication refers to: Liu G, Loraine AE, Shigeta R, Cline M, Cheng J, 
Valmeekam V, Sun S, Rulp D, Siani-Rose MA. NetAffx: Asymetrix probesets and 

15 annotations. Nucleic Acids Res. 2003;31(l):82-6. 

The sequence data are omitted due to their large size, and because they do not 
change, whereas the annotation data are updated periodically, for example new 
information on chromomal location and functional annotation of the respective 
20 gene products. Sequence data are available for download in the NetAffx Download 
Center (www.affymetrix.com) 

Data fields: 

In the following section, the content of each field of the data files are described. 
25 Microarray probesets, for example found to be differentially expressed between 
different types of leukemia samples are further described by additional information. 
The fields are of the following types: 

1 . GeneChip Array Information 
30 2. Probe Design Information 

3. Public Domain and Genomic References 

1 . GeneChip Array Information 

35 HG-U133 ProbeSetJD: 

HG-U133 ProbeSet_ID describes the probe set identifier. Examples are: 

200007 at, 20001 l_s_at, 200012_x_at 
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GeneChip: 

The description of the GeneChip probe array name where the respective probeset is 
represented Examples are: Affymetrix Human Genome U133A Array or 
5 Affymetrix Human Genome Ul 33B Array. 

2. Probe Design Information 

Sequence Type: 

1 0 The Sequence Type indicates whether the sequence is an Exemplar, Consensus or 
Control sequence. An Exemplar is a single nucleotide sequence taken directly from 
a public database. This sequence could be an mRNA or EST. A Consensus 
sequence, is a nucleotide sequence assembled by Affymetrix, based on one or more 
sequence taken from a public database. 

15 

Transcript ID: 

The cluster identification number with a sub-cluster identifier appended. 

■ 

Sequence Derived From: 
20 The accession number of the single sequence, or representative sequence on which 
the probe set is based. Refer to the "Sequence Source" field to determine the 
database used. 

Sequence ID: 

25 For Exemplar sequences: Public accession number or GenBank identifier. For 
Consensus sequences: Affymetrix identification number or public accession 
number. 

Sequence Source: 

30 The database from which the sequence used to design this probe set was taken. 

Examples are: GenBank®, RefSeq, UniGene, TIGR (annotations from The 
Institute for Genomic Research). 
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3. Public Domain and Genomic References 

Most of the data in this section come from LocusLink and UniGene databases, and 
are annotations of the reference sequence on which the probe set is modeled. 

5 

Gene Symbol and Title: 

A gene symbol and a short title, when one is available. Such symbols are assigned 
by different organizations for different species. Affymetrix annotational data come 
from the UniGene record. There is no indication which species-specific databank 
10 was used, but some of the possibilities include for example HUGO: The Human 
Genome Organization. 

MapLocation: 

The map location describes the chromosomal location when one is available. 

15 

UnigeneAccession: 

UniGene accession number and cluster type. Cluster type can be "full length" or 
"est", or " — " if unknown. 

20 LocusLink: 

4 

This information represents the LocusLink accession number. 
Full Length Ref. Sequences: 

Indicates the references to multiple sequences in RefSeq. The field contains the ID 
25 and description for each entry, and there can be multiple entries per probeSet. 

Example 3: Sample preparation, processing and data analysis 

Method 1 : 

30 Microarray analyses were performed utilizing the GeneChip® System (Affymetrix, 
Santa Clara, USA). Hybridization target preparations were performed according to 
recommended protocols (Affymetrix Technical Manual). In detail, at time of 
diagnosis, mononuclear cells were purified by Ficoll-Hypaque density 
centrifugation. They had been lysed immediately in RLT buffer (Qiagen, Hilden, 

35 Germany), frozen, and stored at -80°C from 1 week to 38 months. For gene 
expression profiling cell lysates of the leukemia samples were thawed, 



WO 2005/043168 



PCT7EP2004/012470 



-32- 



homogenized (QIAshredder, Qiagen), and total RNA was extracted (RNeasy Mini 
Kit, Qiagen). Subsequently, 5-10 pg total RNA isolated from 1 x 10 7 cells was 
used as starting material for cDNA synthesis with oligo[(dT)24T7promotor]65 
primer (cDNA Synthesis System, Roche Applied Science, Mannheim, Germany). 
5 cDNA products were purified by phenol/cMorophorm/IAA extraction (Ambion, 
Austin, USA) and acetate/ethanol-precipitated overnight. For detection of the 
hybridized target nucleic acid biotin-labeled ribonucleotides were incorporated 
during the following in vitro transcription reaction (Enzo BioArray HighYield 
RNA Transcript Labeling Kit, Enzo Diagnostics). After quantification by 
10 spectrophotometric measurements and 260/280 absorbance values assessment for 
quality control of the purified cRNA (RNeasy Mini Kit, Qiagen), 15 jig cRNA was 
fragmented by alkaline treatment (200 mM Tris-acetate, pH 8.2/500 mM potassium 
acetate/150 mM magnesium acetate) and added to the hybridization cocktail 
sufficient for five hybridizations on standard GeneChip microarrays (300 pi final 
1 5 volume). Washing and staining of the probe arrays was performed according to the 
recommended Fluidics Station protocol (EukGE-WS2v4). Afiymetrix Microarray 
Suite software (version 5.0.1) extracted fluorescence signal intensities from each 
feature on the microarrays as detected by confocal laser scanning according to the 
manufacturers recommendations. 

Expression analysis quality assessment parameters included visiual array 
inspection of the scanned image for the presence of image artifacts and correct grid 
alignment for the identification of distinct probe cells as well as both low 375' 
ratio of housekeeping controls (mean: 1.90 for GAPDH) and high percentage of 
detection calls (mean: 46.3% present called genes). The 3' to 5' ratio of GAPDH 
probesets can be used to assess RNA sample and assay quality. Signal values of the 
3 9 probe sets for GAPDH are compared to the Signal values of the corresponding 
5' probe set. The ratio of the 3 9 probe set to the 5' probe set is generally no more 
than 3.0. A high 3' to 5' ratio may indicate degraded RNA or inefficient synthesis 
of ds cDNA or biotinylated cRNA (GeneChip® Expression Analysis Technical 
Manual, www.affymetrix.com). Detection calls are used to determine whether the 
transcript of a gene is detected (present) or undetected (absent) and were calculated 
using default parameters of the Microarray Analysis Suite MAS 5.0 software 
package. 



20 



25 



30 



35 
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Method 2: 

Bone marrow (BM) aspirates are taken at the time of the initial diagnostic biopsy 
and remaining material is immediately lysed in RLT buffer (Qiagen), frozen and 
stored at -80 C until preparation for gene expression analysis. For microarray 
analysis the GeneChip System (Affymetrix, Santa Clara, CA, USA) is used. The 
targets for GeneChip analysis are prepared according to the current Expression 
Analysis. Briefly, frozen lysates of the leukemia samples are thawed, homogenized 
(QIAshredder, Qiagen) and total RNA extracted (RNeasy Mini Kit, 
Qiagen).Normally 10 ug total RNA isolated from 1 x 107 cells is used as starting 
material in the subsequent cDNA-Synthesis using 01igo-dT-T7-Promotor Primer 
(cDNA synthesis Kit, Roche Molecular Biochemicals). The cDNA is purified by 
phenol-chlorophorm extraction and precipitated with 100% Ethanol over night. For 
detection of the hybridized target nucleic acid biotin-labeled ribonucleotides are 
incorporated during the in vitro transcription reaction (Enzo® BioArray™ 
HighYield™ RNA Transcript Labeling Kit, ENZO). After quantification of the 
purified cRNA (RNeasy Mini Kit, Qiagen), 15 ug are fragmented by alkaline 
treatment (200 mM Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM 
magnesium acetate) and added to the hybridization cocktail sufficient for 5 
hybridizations on standard GeneChip microarrays. Before expression profiling 
Test3 Probe Arrays (Affymetrix) are chosen for monitoring of the integrity of the 
cRNA. Only labeled cRNA-cocktails which showed a ratio of the messured 
intensity of the 3' to the 5' end of the GAPDH gene less than 3.0 are selected for 
subsequent hybridization on HG-U133 probe arrays (Affymetrix). Washing and 
staining the Probe arrays is performed as described (siehe Affymetrix-Original- 
Literatur (LOCKHART und LIPSHUTZ). The Affymetrix software (Microarray 
Suite, Version 4.0.1) extracted fluorescence intensities from each element on the 
arrays as detected by confocal laser scanning according to the manufacturers 
recommendations. 
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3.36E-02 


1.71 


6.47 18q21 



2.9 Double versus Status 2 



# affy id HUGO name fc 

1 226245_at 1 .61 

2 221728_x_at 4.65 

3 237548_at 2.07 

4 213911_s_at H2AFZ 1.39 

5 220623_s_at TSGA10 1.88 

6 208700_s_at TKT 1 .71 

7 203772_at BLVRA 3.32 

8 204083_s_at TPM2 -11.16 

9 241655_at 2.16 

10 214037_s_at JM1 175 

11 213827_at SNX26 1.83 

12 201464 x at JUN 169 



13 36552 at DKFZP586P0123 1.43 



q stn t Map 

Location 



1 .26E-07 


5.22E-03 


1.87 


8.69 


4.96E-04 


1 .65E-01 


2.03 


8.18 


4.43E-04 


1 .55E-01 


1.95 


7.96 


9.73E-07 


1.08E-02 


1.69 


7.81 4q24 


3.24E-07 


5.22E-03 


1.63 


7.56 2q11.2 


3.52E-07 


5.22E-03 


1.60 


7.45 3p14.3 


3.06E-04 


1.27E-01 


1.67 


7.07 7p14-cen 


1.21 E-06 


1 .08E-02 


-1.51 


-6.95 9p13.2- 








p13.1 


1 .85E-06 


1.17E-02 


1.48 


6.93 


1 .51 E-06 


1.12E-02 


1.46 


6.79 Xp11.23 


1 .28E-05 


3.57E-02 


1.48 


6.77 19q13.12 


2.85E-06 


1 .59E-02 


1.41 


6.59 1p32-p31 


1 .51 E-03 


2.42 E-01 


1.60 


6.43 11q13.3 



WO 2005/043168 

14 210118_S_at 

15 219392_x_at 

16 214092_x_at 

17 200595_s_at 

18 201382_at 

19 200030_s_at- 
HG-U133A 

20 206841 _at 

21 239246_at 

22 205962_at 

23 204332_s_at 

24 229697_at 

25 215587_x_at 

26 244125_at 

27 214630_at 

28 208064_s_at 

29 243613_at 

30 242573_at 

31 241536_at 

32 211987_at 

33 234809_at 

34 AFFX-BioDn-5. 

35 208891 _at 

36 232322_x_at 

37 208892_s_at 

38 222984_at 

39 221952_x_at 

40 20661 4_at 

41 20071 5_x_at 

42 215262_at 

43 225248_at 

44 231204_at 

45 214826_at 

46 232075_at 

47 211747_s_at 

48 213975_s_at 

49 203597_s_at 

50 209981 at 



IL1A 

FLJ11029 
SFRS14 
EIF3S10 
SIP 

SLC25A3 
PDE6H 

PAK2 
AGA 



CYP1 1 B2 

SIAT8C 

MGC24039 



TOP2B 
HCA127 
at-HG-U133A 
DUSP6 
STARD10 
DUSP6 
PAIP2 
KIAA1393 
GDF5 
RPL13A 

SPPL2B 
FLJ11331 
DKFZp667B1218 
REC14 
LSM5 
LY2 
WBP4 
PIPPIN 



2.48 
1.71 
1.41 
1.38 
1.78 
-1.22 

3.84 
1.95 
4.02 
1.83 
1.51 
2.99 

-5.79 
2.06 

-5.23 
1.77 
1.79 
1.84 
1.57 
3.71 
1.35 
1.96 

-4.95 
2.12 

-1.32 
1.39 

-2.83 

-1.31 
1.97 

-3.23 

-7.44 
2.02 
1.62 
1.55 
1.52 

-1.56 
2.17 
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9.02E-05 
3.68E-04 
3.94E-06 
3.85E-06 
3.78E-06 
6.71 E-05 

9.83E-03 
3.70E-05 
2.13E-02 
7.64E-04 
2.44E-04 
8.49E-03 
9.34E-06 
6.01 E-04 
1.27E-05 
1.22E-05 
2.34E-04 
1.21 E-05 
1.74E-03 
1.41 E-02 
7.88E-04 
3.58E-05 
5.81 E-05 
1.83E^03 
2.54E-04 
2.84E-05 
7.00E-05 
6.18E-03 
3.62E-05 
3.17E-05 
2.57E-05 
2.41 E-03 
3.08E-05 
3.94E-04 
4.82E-05 
3.55E-05 
1.75E-04 



7.10E-02 
1.38E-01 
1 .60E-02 
1.60E-02 
1.60E-02 
7.10E-02 

4.73E-01 
6.60E-02 
6.01 E-01 
1.86E-01 
1.18E-01 
4.52E-01 
3.46E-02 
1 .73E-01 
3.57E-02 
3.57E-02 
1.16E-01 
3.57E-02 
2.62E-01 
5.27E-01 
1 .86E-01 
6.60E-02 
7.10E-02 
2.68E-01 
1.18E-01 
6.60E-02 
7.10E-02 
4.08E-01 
6.60E-02 
6.60E-02 
6.60E-02 
2.82E-01 
6.60E-02 
1.43E-01 
7.10E-02 
6.60E-02 
1.06E-01 
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1.44 6.43 2q14 

1.50 6.43 17q23.2 

1.39 6.4019p12 

1.35 6.31 10q26 

1.34 6.30 1q24-q25 
-1.39 -6.25 12q23 

1.87 6.24 12p13 

1.36 6.19 

2.35 6.07 3q29 

1 .42 6.01 4q32-q33 

1 .37 6.01 
1 .72 6.00 

-1 .30 -5.98 

1.36 5.86 8q21-q22 
-1.26 -5.82 18q21.2 

1.24 5.78 12p1 1.21 

1 .30 5.77 

1 .23 5.77 

1 .40 5.76 3p24 
1.78 5.75 Xq1 1.1 
1 .34 5.72 

1.23 5.72 12q22-q23 

-1.23 -5.66 11q13 

1.37 5.66 12q22-q23 
-1 .27 -5.66 5q31 .3 

1.20 5.59 14q23.1 

-1.21 -5.56 20q1 1.2 

-1.48 -5.55 19q1 3.3 

1.19 5.55 

-1.21 -5.46 19p13.3 

-1.17 -5.46 4q25 

1 .33 5.43 3p21 .2 

1.16 5.43 15q24.1 

1.22 5.41 7p14.3 

1.16 5.41 12q14.3 

-1.15 -5.39 13q1 3.3 

1.19 5.38 22q13.2- 

q13.31 



2.10 Double versus Status 3 



affy id 

1 236737_at 

2 220623_s_at 

3 205282_at 

4 225385 s at 



HUGO name 

FLJ31528 
TSGA10 
LRP8 
LOC92906 



fc 



2.77 
2.10 
1.73 
2.61 



7.41 E-1 3 
2.98E-12 
1 .64E-1 1 
1.50E-07 



2.90E-08 
5.83E-08 
2.14E-07 
1.59E-04 



stn t Map 

Location 
2.23 12.26 17q25.3 

1.99 11.27 2q1 1.2 

1.96 11.04 1p34 

2.06 10.96 2p22.2 



WO 2005/043168 

5 201382_at 

6 222673_x_at 

7 231731_at 

8 203983_at 

9 218262_at 

10 241536_at 

1 1 243280_at 

12 225683_x_at 

13 239246_at 

14 222779_s_at 

15 214038_at 

16 242053_at 

17 230438_at 

18 214092_x_at 

19 243938_x_at 

20 237852_at 

21 228322_at 

22 239405_at 

23 213911_s_at 

24 232856_at 

25 220461_at 

26 226944_at 

27 232075_at 

28 233501 _at 

29 24063 1_at 

30 238751 _at 

31 235766_x_at 

32 234938_at 

33 242573_at 

34 206245_s_at 

35 23436 1_at 

36 236396_at 

37 221952_x_at 

38 202462_s_at 

39 233823_at 

40 241655_at 

41 214037_s_at 

42 240354_at 

43 207287_at 

44 205083_at 

45 229812_at 

46 205424_at 

47 236240_at 

48 211747_s_at 

49 234682_at 

50 218315 s at 
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SIP 

LOC1 59090 
OTX2 
TSNAX 
FLJ22318 



PHP14 

HSA277841 
CCL8 

TBX15 

SFRS14 

DNAH5 

C14orf35 

H2AFZ 

FLJ11383 

HTRA3 

REC14 

MASS1 



BACH1 



NS1-BP 



KIAA1393 
KIAA0801 
KIAA1276 

JM1 

MGC35033 

FU14107 

AOX1 

FLJ23277 

ProSAPiP2 

FLJ21106 

LSM5 

CDK5RAP1 



2.01 

1.73 

1.93 

1.86 

1.64 

1.94 

1.76 

2.53 

1.97 

1.54 

2.14 

1.64 

1.57 

1.41 

2.85 

1.81 

2.41 

1.71 

1.45 

1.87 

1.72 

2.29 

1.85 

1.61 

1.56 

2.29 

1.96 

2.02 

2.03 

1.79 

2.20 
3.48 
1.42 
1.51 
2.42 
1.91 
1.61 
1.70 
1.72 
1.98 
1.52 
-4.45 
1.74 
1.67 
1.63 
1.33 



7.99E-10 
2.61 E-08 
2.68E-1 1 
2.27E-07 
1 .00E-09 
1 .30E-08 
3.00E-10 
3.64E-04 
7.95E-04 
6.58E-07 
5.93E-09 
3.49E-07 
5.55E-10 
4.78E-10 
1 .51 E-04 
6.01 E-06 
9.71 E-10 
5.41 E-09 
6.43E-08 
1.17E-05 
2.91 E-06 
1 .24E-04 
1.41E-07 
8.20E-09 
9.05E-09 
1 .42E-08 
8.58E-09 
1 .57E-03 
4.71 E-04 
1 .25E-05 

1 .30E-04 
3.47E-03 
5.57E-06 
1 .95E-08 
4.54E-06 
1 .63E-06 
1 .09E-08 
1 .72E-08 
1 .78E-05 
3.50E-06 
1 .25E-07 
2.23E-08 
1.14E-07 
1.01E-03 
3.04E-04 
6.27E-08 



3.91 E-06 
4.65E-05 
2.63E-07 
2.02E-04 
3.92E-06 
3.01 E-05 
2.35E-06 
2.96E-02 
4.97E-02 
4.60E-04 
1 .93E-05 
2.85E-04 
3.10E-06 
3.10E-06 
1 .68E-02 
2.07E-03 
3.92E-06 
1 .93E-05 
9.10E-05 
3.28E-03 
1 .27E-03 
1.44E-02 
1 .53E-04 
2.36E-05 
2.36E-05 
3.09E-05 
2.36E-05 
7.21 E-02 
3.52E-02 
3.42E-03 

1.49E-02 
1.16E-01 
2.02E-03 
3.82E-05 
1.72E-03 
8.68E-04 
2.67E-05 
3.54E-05 
4.33E-03 
1.43E-03 
1.49E-04 
4.15E-05 
1.44E-04 
5.57E-02 
2.64E-02 
9.10E-05 



PCT7EP2004/0 12470, . 0 
i auie i mid £ 



1.90 10.55 1q24-q25 

1 .92 10.42 Xq26.3 

1.81 10.23 14q21-q22 

1.83 9.791q42.1 

1 .73 9.68 5q35.3 

1 .74 9.60 
1 .69 9.51 

2.01 9.23 9q34.3 

2.09 9.16 

1.71 9.15 17p13.3 

1.63 9.08 17q11.2 

1 .68 9.07 

1.60 9.06 1p1 1.1 

1.60 9.02 19p12 

1.88 9.02 5p 15. 2 

1.69 8.80 

1.55 8.7714q22.2 

1 .55 8.70 

1 .56 8.62 4q24 

1.64 8.47 

1.58 8.40 1q42.2 

1.68 8.25 4p1 6.1 

1.49 8.1915q24.1 
1.45 8.16 

1.45 8.15 5q13 

1.45 8.14 

1 .43 8.09 

1.88 8.09 21q22.11 

1 .73 8.08 

1.53 8.00 1q25.1- 

q31.1 

1.62 7.99 

2.00 7.98 

1.50 7.94 14q23.1 

1.41 7.94 5q3 1.1 
1.48 7.87 4p1 6 

1 .45 7.83 

1.38 7.81 Xp1 1.23 

1.42 7.78 12q1 3.11 
1 .48 7.72 8p21 .2 

1.44 7.71 2q33 
1.38 7.69 1p36.12 

-1.38 -7.6417q21.32 

1 .36 7.60 4q28.1 

1.67 7.58 7p1 4.3 

1 .57 7.57 

1 .34 7.53 20pter- 

q1 1 .23 



WO 2005/043168 
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2.11 Double versus Status 4 



# affyid 

1 215225_s_at 

2 207201_s_at 

3 222583_s_at 

4 218728_s_at 

5 237092_at 

6 236737_at 

7 207114_at 

8 206245_s_at 

9 218538_s_at 

10 202648_at 

1 1 206244_at 

12 205237_at 

13 239246_at 

14 224839_s_at 

15 218778_x__at 

16 218479_s_at 

17 213851_at 

18 214380_at 

19 223672_at 

20 223756_at 

21 203983_at 

22 212490_at 

23 209515_s_at 

24 241493_at 

25 211747_s_at 

26 211742_s_at 

27 24l394_at 

28 223393_s_at 

29 244675_at 

30 230260_s_at 

31 222974_at 

32 217239_x_at 

33 210152_at 

34 205036_at 

35 231216_at 

36 232908_at 

37 202247_s_at 

38 20l540_at 

39 200795_at 

40 220461_at 

41 236239_at 

42 207509 s at 



HUGO name 

GPR17 
SLC22A1 
NUP50 
HSPC163 

FLJ31528 

LY6G6C 

NS1-BP 

MRS2L 

RPS19 

CR1 

FCN1 

GPT2 

EPS8R1 

XP04 

PRPF31 

DKFZp761D221 

FLJ 10081 

TSNAX 

DNAJC8 

RAB27A 



LSM5 
EVI2B 

KIAA1474 
RGS8 
KIAA0084 
IL22 

LILRB4 
LSM6 



MTA1 
FHL1 
SPARCL1 
FLJ 11383 

LAIR2 



fc p 

5.75 4.09E-06 

3.51 2.62E-06 

3.87 3.77E-05 

2.30 4.99E-05 

2.61 8.21 E-06 

3.89 1 .60E-04 

1 .63 1 .09E-04 

3.04 2.55E-05 

1.85 4.16E-05 

-7.1 1 2.64E-04 

9.50 6.73E-03 

5.69 5.48E-05 
2.75 1 .28E-04 
1 .77 8.08E-03 

-5.46 8.36E-05 

1.32 1.25E-04 

2.70 5.48E-04 
2.29 5.11E-04 

4.26 2.25E-03 
5.32 1 .21 E-03 

2.11 1.58E-04 
1.35 7.92E-04 
3.68 1.25E-04 

4.43 2.16E-04 

1.97 4.33E-04 

2.09 1.45E-04 

8.61 1.03E-02 
3.40 3.13E-04 
3.39 5.91 E-04 
3.65 1.29E-03 

7.62 8.59E-03 
8.65 1.44E-02 
4.14 2.12E-04 

2.27 1.97E-04 
2.54 9.91 E-03 

2.12 3.98E-04 
-1.89 2.30E-04 
-5.17 4.48E-04 
2.57 2.92E-03 
2.14 3.55E-04 
1.37 2.22E-03 
2.16 2.02E-03 



stn t 



7 90E-02 




7.90E-02 


6.07 


2.64E-01 


5.93 


2.64E-01 


5.55 


1.06E-01 


4.99 


3.86E-01 


5.72 


3.79E-01 


4.50 


2.46E-01 


4.16 


2.64E-01 


3.78 


4.52E-01 


-4.11 


6.74E-01 


4.79 


2.64E-01 


3.64 


3.79E-01 


3.63 


6.77E-01 


4.80 


3.59E-01 


-3.48 


3.79E-01 


3.25 


4.67E-01 


3.34 


4.67E-01 


3.56 


5.63E-01 


3.50 


4.85E-01 


3.34 


3.86E-01 


3.17 


4.85E-01 


3.59 


3.79E-01 


3.11 


4.39E-01 


3.17 


4.67E-01 


3.16 


3.86E-01 


3.01 


6.88E-01 


3.89 

9M* ■ *jpT 


4.52E-01 


3.02 


4.67E-01 


3.14 

^mr 9 9 W 


4.85E-01 


3.02 


6.77E-01 


3.54 


6.99E-01 


4.36 






4.39E-01 


2.88 


6.85E-01 


3.47 


4.67E-01 


2.80 


4.44E-01 


-2.78 


4.67E-01 


-2.86 


5.87E-01 


2.97 


4.67E-01 


2.79 


5.62E-01 


2.89 


5.34E-01 


2.87 



Map 
Location 
17.64 2q21 

17.13 6q26 

15.55 22q13.31 

14.55 1q42.12 

14.11 

13.53 17q25.3 
11.90 6p21.31 

11.76 1q25.1- 

q31.1 
10.70 6p22.3- 

p22.1 
-10.56 19q13.2 

10.36 1q32 

10.25 9q34 

10.15 

10.01 16q12.1 
-9.82 19q 13.42 

9.17 13q11 

9.13 

9.12 19q13.42 

9.03 1p31.2 

8.92 2p1 2-p1 1 .2 

8.84 1q42.1 

8.78 1p35.2 

8.76 15q15- 
q21.1 . 

8.74 

8.74 7p14.3 
8.50 17q11.2 
8.44 

8.30 19q12 
8.27 1q25 
8.17 3p24.3 
8.17 12q15 
8.14 

8.12 19q13.4 
8.10 4q31.21 
7.91 
7.87 

-7.85 14q32.3 

-7.83 Xq26 
7.78 4q22.1 
7.74 1q42.2 
7.70 

7.68 19q13.4 



WO 2005/043168 

43 219251_s_at 

44 216406_at 

45 234902_s_at 

46 237563_s_at 

47 234938_at 

48 243317_at 

49 240609_at 

50 204083_s_at 
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FLJ 10300 



FLJ20557 



BACH1 



TPM2 



-3.21 
5.56 

-5.67 
2.09 
2.31 
2.49 
1.96 

■13.16 



3.35E-03 
9.93E-03 
2.88E-04 
2.97E-04 
4.73E-04 
3.16E-04 
3.03E-04 
1 .35E-03 



6.17E-01 
6.85E-01 
4.52E-01 
4.52E-01 
4.67E-01 
4.52E-01 
4.52E-01 
4.85E-01 



PCT/EP2004/012470 H 0 
t auie i and i 



•2.91 -7.58 7q36.3 

3.26 7.57 

•2.68 -7.55 19q13.4 

2.67 7.55 

2.68 7.53 21q22.11 
2.68 7.51 

2.65 7.50 

-2.99 -7.47 9p1 3.2- 

p13.1 



2.12 Double versus Status 5 



affy id 


HUGO name 


fc I 


1 239567 at 




4.97 


2 202228 s at 


SDFR1 


2.28 


3 220623 s at 


TSGA10 


4.10 


4 238481 at 


MGP 


4.47 


5 91816_f_at 




1.67 


6 230420_at 




3.86 


7 232738_at 


KIAA1680 


6.89 


8 209981_at 


PIPPIN 


3.09 








I U cSJ f U 1 D S al 


MLUn Int 




11 90^9T7 at 




^ 7 C > 


19 9**fl9fiA c at 


A pi qo 


9 94 

Cm. err 


IO Z I I ^OH- S at 


O ix IN 


9 7Q 


A A 99Q**9n at 




1 7ft 


1C 99QQ19 at 


PI 19^977 




16 235881_at 


rnOU2 


b.Uo 


17 229986_at 




1.63 


18 209230_s_at 


P8 


3.00 


19 206244_at 


CR1 


4.14 


20 204493_at 


BID 


1.87 


21 202462_s_at 


KIAA0801 


1.56 


22 212412_at 




2.23 


23 222569_at 


UGCGL1 


1.82 


24 218109_s_at 


FU14153 


1.47 


25 240609_at 




1.68 


26 221 1 56_x_at 


CPR8 


2.15 


27 2431 14_at 




2.82 


28 236001 _at 




1.70 


29 237714_at 




2.31 


30 212873_at 


HA-1 


1.69 


31 236898_at 




5.08 


32 237221_at 




5.55 


33 232075 at 


REC14 


2.01 



q stn t Map 

Location 



6.11E-06 


1.13E-01 


5.20 


14.90 


2.46E-06 


9.49E-02 


4.77 


14.20 15q22 


1.17E-05 


1.13E-01 


5.03 


14.17 2q11.2 


1 .88E-04 


4.53E-01 


4.74 


13.35 12p13.1- 








p12.3 


1 17F-OA 
1.1/ n.-yjH- 






I £..<J 1 


o.yuc-uo 


I . 1 OC"U 1 




19 M 


9 ni p n**. 


O.O I C-U I 


** 71 








■3 1Q 


Q 1 1 99n1^ 9- 
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2.16 Status 1 versus Status 4 
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43 220833_at 




2.63 


7.16E-04 


6.06E-01 


0.81 


3.99 


44 209703_x_at 


DKFZP586A0522 


1.93 


5.81 E-04 


6.06E-01 


0.80 


3.99 12q13.12 


45 204337_at 




2.37 


6.75E-04 


6.06E-01 


0.79 


3.93 


46 215028_at 


SEMA6A 


3.58 


7.15E-04 


6.06E-01 


0.79 


3.92 5q23.1 


47 211039_at 


CHRNA1 


2.55 


8.87E-04 


6.06 E-01 


0.79 


3.92 2q24-q32 


48 214668_at 


C13orf1 


2.88 


8.41 E-04 


6.06E-01 


0.81 . 


3.92 13q14 


49 206893_at 


SALL1 


3.75 


9.27E-04 


6.06E-01 


0.79 


3.91 16q12.1 


50 209373 at 


BENE 


2.93 


7.90E-04 


6.06E-01 


0.79 


3.91 2q13 



2.22 Status 2 versus normal 



# affyid HUGO name fc p 

1 227935_s_at MGC16202 -1.71 

2 226 1 96_s_at MGC 1 6028 2 .29 

3 204120_s_at ADK 1.78 

4 200014_s_at - HNRPC 1.30 
HG-U133B 

5 218409_s_at DNAJC1 1.65 

6 205372_at PLAG1 -3.49 

7 220296_at FLJ11715 -1.90 

8 2O0021_at- CFL1 -1.16 
HG-U133B 

9 229963_at -4.84 

1 0 230341 _x_at ADAMTS 10 -1 .74 

11 203050_at TP53BP1 1.64 

12 218643_s_at CRIPT 1.82 

13 212151_at -2.68 

14 202972_s_at FAM13A1 -1.40 

15 208426_x_at KIR2DL4 -1.43 

16 214462_at SOCS4 -1.57 

17 224965_at GNG2 -1.85 

18 209014_at MAGED1 1.80 

19 204044_at QPRT 2.94 

20 214290_s_at HIST2H2AA 1.59 

21 235463_s_at LOC253782 1.61 

22 232341_x_at HABP4 -1.52 

23 204141_at TUBB -2.82 

24 21 8829_s_at K1AA1 416 -1 .91 

25 202501_at MAPRE2 1 .57 

26 200029_at - RPL1 9 -1.13 
HG-U133B 

27 2041 97_s_at RUNX3 -1 .62 

28 236248_x_at -2.10 

29 201279_s_at DAB2 1.80 

30 200807_s_at HSPD1 1 .39 

31 205070_at ING3 -1.37 

32 228003_at -1 .55 

33 203162 sat KATNB1 1.62 



q 


stn t 


Map 








Location 


5.98E-07 


2.21 E-02 


-0.66 


-5.64 1 0q23.32 


1.00E-05 


7.43E-02 


0.74 


5.42 14q24.2 


2.96E-05 


9.11 E-02 


0.75 


5.16 10cen-q24 


2.33E-05 


8.61 E-02 


0.66 


5.01 14q11.1 




9.11 E-02 


0.64 


4 91 10d12 31 




7.43E-02 


-0.55 


-4 87 8a12 




7.43E-02 


-0.54 


-4 79 5a33 2 




7.43E-02 


-0.55 


-4 76 11a13 


1 .06E-05 


7.43E-02 


-0.53 


-4.73 


1.41E-05 


7.43E-02 


-0.52 


-4.64 19p13.2 




1.71 E-01 


0.70 


4 62 15a15-a21 




1 .41 E-01 


0.63 


4 60 2d21 


1 R4F-05 


8.49E-02 


-0.52 


-4 59 


2 23E-05 

Cm lt.VU. WV/ 


8.61 E-02 


-0.51 


-4 53 4a22 1 


3 48E-05 


9.18E-02 


-0.52 


-4 50 19a13 4 


5 36E-05 


1.11 E-01 


-0.54 


-4 47 18a22 2 




9.18E-02 


-0.50 




2.50E-04 


1 .77E-01 


0.71 


4.41 Xp11.23 


2.93E-04 


1 .89E-01 


0.76 


4.40 16p12.1 


8.58E-05 


1.38E-01 


0.54 


4.39 1q2 1.2 


1.76E-04 


1 .74E-01 


0.61 


4.39 2q31.1 


4.25E-05 


9.80E-02 


-0.50 


-4.37 9q22.3-q31 


4.24E-05 


9.80E-02 


-0.49 


-4.35 6p21.3 


5.41 E-05 


1.11 E-01 


-0.50 


-4.33 8q12.1 


1 .94E-04 


1.74E-01 


0.59 


4.32 18q12.1 


7.79E-05 


1.31 E-01 


-0.50 


-4.27 17q11.2- 








q12 


6.30E-05 


1 .22E-01 


-0.48 


-4.27 1p36 


6.95E-05 


1 .28E-01 


-0.48 


-4.25 


1.71 E-04 


1 .74E-01 


0.53 


4.21 5p13 


2.13E-04 


1 .74E-01 


0.55 


4.20 2q33.1 


1 .02E-04 


1 .50E-01 


-0.49 


-4.20 7q31 


7.67E-05 


1 .31 E-01 


-0.47 


-4.19 


3.27E-04 


1 .89E-01 


0.60 


4.18 16q12.2 
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34 20521 5_at 


RNF2 


1.51 


1 .57E-04 


1 .74E-01 


0.51 


4.17 1q25.2 


35 22801 1_at 


LOC1 37392 


3.49 


5.24E-04 


1 .90E-01 


0.72 


4.15 8q21.3 


36 229971_at 


GPR114 


1.96 


2.92E-04 


1 .89E-01 


0.56 


4.13 16q12.2 


37 203830_at 


NJMU-R1 


1.75 


3.73E-04 


1 .89E-01 


0.58 


4.10 17q1 1.2 


38 213804_at 


INPP5B 


1.43 


2.38E-04 


1.77E-01 


0.52 


4.10 1p34 


39 227860_at 


CPXM 


2.43 


5.11E-04 


1 .90E-01 


0.63 


4.08 20p12.3- 














p13 


40 222451_s_at 


ZDHHC9 


1.95 


3.72E-04 


1 .89E-01 


0.56 


At A*^. A*^ A^S. 

4.08 9 


41 229072_at 




-2.52 


1.10E-04 


1 .56E-01 


-0.46 


At 0*. 

-4.08 


42 224617_at 


ROD1 


1.35 


3.31 E-04 


1.89E-01 


df\ ^ m At 

0.54 


m MWk\ *mm Am* a*^ jm* 

4.07 9q32 


43 235346_at 


MGC51029 


1.40 


2.22E-04 


1 .74E-01 


0.50 


a\ a*K. A*^. at A J 

4.06 Xp1 1 .3 


44 235556_at 




-1.45 


1.48E-04 


1 .74E-01 


-0.46 


-4.05 


45 210479_s_at 


RORA 


-2.18 


1 .30E-04 


1.71E-01 


-0.45 


At A^S A+\, At ^ m A**. At A*** A^\ 

-4.03 15q21-q22 


46 218395_at 


FLJ 13433 


1.60 


4.50E-04 


1 .90E-01 


0.56 


4.01 12q23.2 


47 211296_x_at 


UBC 


-1.13 


1 .63E-04 


1 .74E-01 


-0.46 


-4.01 12q24.3 


48 202862_at 


FAH 


1.95 


6.40E-04 


1.91E-01 


0.62 


3.99 15q23-q25 


49 214697_s_at 


ROD1 


1.54 


4.84E-04 


1 .90E-01 


0.56 


3.99 9q32 


50 224618 at 


ROD1 


1.58 


4.28E-04 


1 .90E-01 


0.54 


3.99 9q32 



2.23 Status 3 versus Status 4 



# affy id HUGO name fc p q stn t Map 

Location 



1 36545_s_at 


KIAA0542 


-1.58 


7.51 E-07 


4.75E-03 


-1.49 


-8.09 22q12.2 


2 222753_s_at 


FLJ22649 


1.96 


4.09E-08 


1 .51 E-03 


1.23 


7.17 4q34.2 


3 212608_s_at 




-1.60 


4.09E-05 


3.52E-02 


-1.36 


-6.91 


4 221387_at 


OT7T022 


2.38 


9.83E-08 


1 .82E-03 


1.18 


6.89 10q21-q22 


5 239652_at 




2.67 


2.51 E-07 


2.32E-03 


1.17 


6.76 


6 227227_at 




-1.84 


2.44E-05 


3.20E-02 


-1.27 


-6.69 


7 216117_at 




2.99 


2.29E-07 


2.32E-03 


1.16 


6.63 


8 238109__at 




-1.91 


7.71 E-07 


4.75E-03 


-1.10 


-6.35 


9 45297_at 


MGC45806 


-3.80 


1.08E-03 


1 .25E-01 


-1.50 


-6.24 1p35.2 


10 219251_s_at 


FLJ 10300 


-1.89 


2.92E-06 


1 .08E-02 


-1.10 


-6.23 7q36.3 


11 22833 1_at 


SELH 


-1.79 


1.14E-05 


2.18E-02 


-1.12 


-6.18 


12 24471 2_at 




-2.45 


2.35E-04 


6.63E-02 


-1.26 


-6.14 


13 225180_at 


FLJ00166 


-1.89 


2.74E-04 


7.14E-02 


-1.25 


-6.08 3q27.2 


14 219595_at 


ZNF26 


-1.33 


1.16E-06 


6.15E-03 


-1.04 


-6.01 12q24.33 


15 229923_at 




-1.52 


3.25E-04 


7.56E-02 


-1.20 


-5.86 


16 238346_s_at 


NCOA6IP 


1.58 


1 .92E-06 


8.47E-03 


1.00 


5.83 8q1 1 


17 23059 1_at 




3.74 


2.06E-06 


8.47E-03 


1.00 


5.81 


18 206607_at 


CBL 


1.98 


8.65E-06 


1 .98E-02 


1.03 


5.80 11q23.3 


19 217526_at 




-1.48 


8.16E-05 


4.84E-02 


-1.08 


-5.73 


20 22034 1_s_at 


LOC51149 


-1.89 


3.87E-05 


3.49E-02 


-1.05 


-5.72 5q35.3 


21 220390_at 


FLJ23598 


-1.65 


1.44E-04 


5.86E-02 


-1.09 


-5.67 11p11.12 


22 213851_at 




2.38 


3.81 E-06 


1.19E-02 


0.97 


5.66 


23 221686_s_at 


RECQL5 


-1.76 


1.15E-04 


5.33E-02 


-1.08 


-5.65 17q25.2- 














q25.3 


24 207707 s at 


SEC13L1 


1.50 


3.38E-05 


3.21 E-02 


1.02 


5.62 3p25-p24 
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25 219833_s_at 


FLJ 10466 


-1.67 


1 .08E-04 


5.32E-02 


-1.06 


-5.61 6p12.1 


26 232901_at 


LOC57038 


3.15 


3.86E-06 


1.19E-02 


0.96 


5.58 6q16.1 


27 201280_s_at 


DAB2 


-2.05 


1 .22E-03 


1.31E-01 


-1.25 


-5.55 5p13 


28 221942_s_at 


GUCY1A3 


-2.43 


9.17E-05 


5.14E-02 


-1.04 


-5.55 4q31.1- 














q31.2 


29 226959_at 




-2.72 


1 .42E-03 


1.39E-01 


-1.26 


-5.53 


30 243886_at 




-2.01 


6.14E-05 


4.37E-02 


-1.01 


-5.51 


31 242491_at 


SMA5 


-1.61 


4.01 E-04 


8.47E-02 


-1.11 


-5.50 5q13 


32 230589_at 




-2.10 


6.38E-05 


4.38E-02 


-1.00 


-5.46 


33 22881 7_at 




-1.38 


9.10E-06 


1 .98E-02 


-0.94 


-5.44 


34 217323_at 


HLA-DRB6 


3.18 


2.72E-05 


3.20E-02 


0.96 


5.40 6p21 .3 


35 238106_at 




2.77 


7.09E-06 


1.98E-02 


0.92 


5.37 


36 231896_s_at 


DENR 


-1.42 


3.38E-04 


7.56E-02 


-1.05 


-5.33 12q24.31 


37 202885_s_at 


PPP2R1B 


3.36 


8.75E-06 


1.98E-02 


0.91 


5.31 11q23 


38 212221_x_at 




-1.86 


2.38E-03 


1.73E-01 


-1.28 


-5.30 


39 200084 at- 


SMAP 


-1.39 


1 .95E-04 


6.20E-02 


-1.01 


-5.29 11 p1 5.1 


HG-U133B 














40 222244_s_at 


FLJ20618 


-1.36 


1 .00E-04 


5.20E-02 


-0.97 


-5.28 22q1 2.2 


41 204939_s_at 


PLN 


3.40 


8.75E-06 


1 .98E-02 


0.91 


5.28 6q22.1 


42 217346_at 




1.68 


1 .07E-04 


5.32E-02 


0.98 


5.28 


43 236695_at 




2.88 


9.95E-06 


2.05E-02 


0.90 


5.27 


44 235195_at 




-1.78 


9.10E-04 


1.13E-01 


-1.11 


-5.27 


45 219964_at 


ST7L 


-1.78 


2.71 E-05 


3.20E-02 


-0.92 


-5.24 1p13.1 


46 238588_at 




-1.77 


1.76E-04 


6.09E-02 


-0.98 


-5.23 


47 22371 6_s_at 


ZNF265 


-1.64 


6.12E-04 


1.03E-01 


-1.06 


-5.23 1 p31 


48 201005_at 


CD9 


-2.62 


6.10E-04 


1.03E-01 


-1.06 


-5.22 12p13.3 


49 220530_at 




4.48 


1.18E-05 


2.18E-02 


0.90 


5.21 


50 217239 x at 




5.80 


1 .53E-05 


2.65E-02 


0.94 


5.21 



2.24 Status 3 versus Status 5 



# affy id HUGO name fc 

1 243322_at 4.13 

2 222461_s_at HERC2 2.96 

3 238067_at FLJ20298 16.08 

4 2 1 5732_s_at DTX2 2.18 

5 206294_at HSD3B2 3.33 

6 215323_at 2.64 

7 208801_at SRP72 -1 .16 

8 230982_at 3.38 

9 244854_at 3.35 

10 244858_at 3.24 

11 240691_at 4.46 

12 23101 0_at PRO0971 1 .64 

13 21 8489_s_at ALAD -2.71 

14 206936_x_at NDUFC2 1.55 

15 207834 at FBLN1 2.65 



q stn t Map 

Location 



6.00E-07 


2.19E-02 


1.08 


6.32 


3.60E-06 


2.31 E-02 


1.03 


5.96 15q13 


1 .93E-06 


2.19E-02 


1.06 


5.95 Xq22.2 


1.18E-05 


4.81 E-02 


1.04 


5.87 7q1 1 .23 


1 .85E-06 


2.19E-02 


0.98 


5.78 1p13.1 


1 .95E-06 


2.19E-02 


0.97 


5.76 


2.71 E-06 


2.31 E-02 


-0.96 


-5.65 4q1 1 


2.27E-05 


5.89E-02 


1.01 


5.63 


3.70E-06 


2.31 E-02 


0.94 


5.57 


4.52E-06 


2.31 E-02 


0.94 


5.52 


4.62 E-06 


2.31 E-02 


0.94 


5.50 


7.35E-06 


3.30E-02 


0.94 


5.49 4q25 


2.03E-04 


1 .55E-01 


-1.07 


-5.46 9q34 


1 .60E-05 


5.59E-02 


0.92 


5.34 11q13.3 


1 .62E-05 


5.59E-02 


0.89 


5.21 22q13.31 
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16 229087_s_at FLJ14775 2.52 

17 219736_at TRIM36 6.95 

18 233395_at 1.46 

19 218121_at HMOX2 1.76 

20 244692_at FLJ39501 3.69 

21 203453_at SCNN1A 2.77 

22 214668_at C13orf1 2.41 

23 230987_at 2.09 

24 239849_at 3.52 

25 206159_at GDF10 2.65 

26 214408_s_at RFPL3S 1.83 

27 243155_at 3.36 

28 231073_at 2.35 

29 216651_s_at GAD2 3.27 

30 215270_at LFNG 3.27 

31 235187_s_at 2.74 

32 228950_s_at FLJ23091 3.13 

33 214893_x_at HCN2 2.86 

34 211132_at FLJ21919 2.49 

35 215802_at 3.33 

36 20831 4_at RRH 1.57 

37 238933_at IRS1 3.02 

38 243812_at RABL4 2.14 

39 23071 7_at 2.95 

40 241489_at 2.21 

41 243839_s_at 1.91 

42 234840_s_at OR5V1 2.27 

43 215028_at SEMA6A 2.81 

44 236870_at 2.94 

45 209373_at BENE 3.04 

46 204337_at 2.66 

47 243585_at 2.13 

48 207952_at IL5 3.08 

49 219793_at SNX16 1.65 

50 214823 at ZNF204 1.84 



PCT/EP2004/0 12470 . - 
i aoie i and 2 



7.87E-05 


1.18E-01 


0.93 


5.17 17q25.1 


1 .84E-05 


5.89E-02 


0.87 


5.05 5q22.2 


2.46E-05 


5.89E-02 


0.86 


5.03 


1.87E-04 


1 .55E-01 


0.93 


5.01 16p13.3 


2.30E-O5 


5.89E-02 


0.85 


5.00 19p13.11 


2.09E-05 


5.89E-02 


0.85 


4.99 12p13 


2.49E-05 


5.89E-02 


0.83 


4.90 13q14 


1 .88E-04 


1 .55E-01 


0.89 


4.88 


5.30E-05 


9.52E-02 


0.84 


4.86 


4.44E-05 


8.46E-02 


0.83 


4.86 10q11.21 


2.96E-04 


1 .85E-01 


0.91 


4.85 22q1 2.3 


3.43E-05 


7.55E-02 


0.82 


4.80 


3.53E-05 


7.55E-02 


0.81 


4.78 


3.78E-04 


2.17E-01 


0.90 


4.78 10p11.23 


4.17E-05 


8.46E-02 


0.81 


4.76 7p22 


4.52E-05 


8.46E-02 


0.79 


4.70 


6.25E-05 


1 .04E-01 


0.81 


4.65 1p31.2 


6.50E-05 


1.04E-01 


0.80 


4.63 19p13.3 


5.54E-05 


9.57E-02 


0.78 


4.63 1q21.3 


1.05E-04 


1 .38E-01 


0.79 


4.59 


1 .97E-04 


1 .55E-01 


0.81 


4.58 4q25 


7.72E-05 


1.18E-01 


0.80 


4.58 2q36 


1.94E-04 


1 .55E-01 


0.81 


4.56 22q13.1 


2.31 E-04 


1 .62E-01 


0.81 


4.53 


8.25E-04 


2.76E-01 


0.88 


4.52 


2.17E-04 


1 .60E-01 


0.80 


4.52 


8.98E-05 


1 .30E-01 


0.76 


4.48 6p21 .32 


1.61 E-04 


1 .50E-01 


0.77 


4.45 5q23.1 


9.50E-05 


1 .33E-01 


0.75 


4.44 


1 .83E-04 


1 .55E-01 


0.77 


4.43 2q13 


1.05E-04 


1.38E-01 


0.75 


4.42 


1.24E-04 


1 .39E-01 


0.75 


4.42 


1 .07E-04 


1 .38E-01 


0.74 


4.40 5q31.1 


7.25E-04 


2.63E-01 


0.83 


4.40 8q21.12 


1.27E-04 


1 .39E-01 


0.75 


4.39 6p21 .3 



2.25 Status 3 versus normal 



# affyid 

1 214698_at 

2 214697_s_at 

3 203124_s_at 

4 234863_x_at 

5 217683_at 

6 209458_x_at 

7 211745 x at 



HUGO name 

ROD1 
ROD1 
SLC11A2 
FBX05 

HBA1 
HBA2 



fc 



stn 



1.95 
1.66 
•2.27 
■2.17 
•3.37 
•1.80 
-1.75 



8.94 E-09 
2.12E-07 
3.71 E-08 
4.37E-08 
4.55E-08 
1.03E-07 
9.82E-08 



1 .91 E-04 
3.49E-04 
2.44E-04 
2.44E-04 
2.44E-04 
3.14E-04 
3.14E-04 



0.84 
0.70 
-0.64 
-0.63 
-0.63 
-0.64 
-0.63 



Map 
Location 
7.08 9q32 

6.04 9q32 

-6.04 12q13 

-6.00 6q25-q26 

-5.99 

-5.90 16p13.3 
-5.89 16p13.3 
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8 237336_at 


ADD2 


9 211396_at 


PCGR2B 


10 22961Q_at 


rLJ4uozy 


11 211699__x_at 


HBA1 


1 2 56748_at 
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